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Abstract 

For a two user cooperative orthogonal frequency division multiple access (OFDMA) system with 
full channel state information (CSI), we obtain the optimal power allocation (PA) policies which 
maximize the rate region achievable by a channel adaptive implementation of inter-subchannel block 
Markov superposition encoding (BMSE), used in conjunction with backwards decoding. We provide 
the optimality conditions that need to be satisfied by the powers associated with the users' codewords 
and derive the closed form expressions for the optimal powers. We propose two algorithms that can be 
used to optimize the powers to achieve any desired rate pair on the rate region boundary: a projected 
£<\ , subgradient algorithm, and an iterative waterfilling-like algorithm based on Karush-Kuhn-Tucker (KKT) 

conditions for optimality, which operates one user at a time and converges much faster. We observe 
that, utilization of power control to take advantage of the diversity offered by the cooperative OFDMA 
system, not only leads to a remarkable improvement in achievable rates, but also may help determine 
how the subchannels have to be instantaneously allocated to various tasks in cooperation. 
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(C| ■ I. Introduction 

^ ■ The ability of OFDMA to cope with both intersymbol and interuser interference, combined 

o: 

with its low complexity of implementation, have made it a popular choice for the next generation 
wireless networks. As a result, the problem of resource allocation in OFDMA systems was 
studied extensively in the literature. One example is 0, where it was proved that in an OFDMA 
uplink system, allocating subcarriers to the users with the maximum marginal rate is a necessary 
condition for maximizing the system throughput. A similar problem was solved in using 
KKT conditions, by optimizing a utility function which was assumed to be a function of the 
rates. In [4], a low-complexity algorithm for subcarrier, power, and rate allocation for OFDMA 
was proposed, to maximize the sum rate under individual rate constraints to guarantee fairness. 
The downlink ergodic sum rate maximization problem was considered in 0, where the authors 
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developed a linear complexity subcarrier and power allocation algorithm. These works, as well 
as many others on OFDMA, naturally assume orthogonal multiple access, thereby choosing to 
avoid interference. However, like all orthogonal transmission techniques, OFDMA incurs some 
rate penalty. Moreover, "interference" in wireless channels is in fact free side information, and 
not ignoring it opens up the possibility of user cooperation. Therefore, in this paper, we focus 
on resource allocation for a two user OFDMA channel, which allows for mutual cooperation 
among the users over each subchannel, each taking into account the available side information. 

The overheard information in a typical wireless multiple access channel (MAC), is captured 
by modeling the system as a MAC with generalized feedback (MAC-GF) BH. In Q, achievable 
rates for the MAC-GF were obtained based on BMSE and backwards decoding. In 0, these 
encoding and decoding techniques were applied to a Gaussian MAC in fading, and the resulting 
rate regions were characterized. In [[8]|, PA policies that maximize the rates achievable by BMSE 
for the same model were obtained. 

While the above works all deal with a scalar MAC-GF, some works on resource allocation for 
user cooperation in vector channels, specifically OFDMA, also exist. A cooperative OFDMA 
system where each user is allowed to transmit and receive at the same time, but necessarily 
on different subcarriers, was considered in [9]. Subcarrier and PA schemes for a time division 
duplex amplify and forward protocol were employed in ifTOll with the aim of maximizing system 
throughput and enhancing fairness in a cooperative OFDMA uplink system. Resource allocation 
and cooperative partner selection in cooperative OFDM networks was investigated with the 
objective of minimizing the overall power in [11]. In [fT2ll . power allocation for an OFDM 
based two-way relay channel using physical network coding is considered. However, these works 
consider either a one sided cooperation strategy, or a mutually cooperative strategy based on two 
parallel dedicated relay channels, or mutual cooperation based on a time division protocol. 

In this paper, we consider a more general cooperative OFDMA model recently introduced in 
|[T3l instead. This model is based on parallel MAC-GFs, and does not make any prior assumptions 
about the way in which the subchannels are assigned to the users. We extend the two full-duplex 
cooperative encoding strategies, namely intra- subchannel cooperative encoding (IntraSCE) and 
inter- subchannel cooperative encoding (InterSCE) of [fT3l . to a channel adaptive scenario. The 



main contributions are (i) the characterization of the long term achievable rate region for a 
two user cooperative OFDMA system with power control; (ii) the analytical derivation of the 
optimal PA policy that results in the best known achievable rate for the non-orthogonal mutually 
cooperative scenario; (iii) the development of two algorithms which obtain the optimal PA, 
and (iv) the evaluation of the achievable rate region under several scenarios, including limited 
CSI feedback. We first obtain the properties of the PA policy that maximizes the sum rate 
of the cooperative OFDMA system employing IntraSCE and InterSCE. Despite the complex 
re-encoding structure employed in InterSCE, the achievable rate region turns out to be of a 
relatively similar form to its scalar counterpart, and we are able to extend some properties of 
the optimal PA derived in |8l for a scalar cooperative MAC, to cooperative OFDMA. As a 
result, the weighted sum of rates, which can be used to obtain any point on the rate region 
boundary, becomes concave, and convex optimization techniques can be employed. We first 
propose a projected subgradient algorithm that converges to the optimum and maximizes the 
achievable rate region. Next, we derive the optimality conditions, and closed form expressions 
for optimum powers analytically. We are then able to propose an alternative efficient iterative 
algorithm with a much lower complexity, to obtain the rate points on the achievable rate region 
boundary. This algorithm works by solving the KKT optimality conditions iteratively over the 
users, to obtain the optimal powers. As a result, we demonstrate that by jointly exploiting the 
diversity provided by OFDMA's parallel subchannels, and the temporal diversity created by the 
time varying channel, we obtain very promising gains in achievable rates. More interestingly, 
we observe that the optimal PA may automatically dictate that some subchannels are assigned 
exclusively to certain users/tasks, depending on the instantaneous channel state, and that, even 
with limited CSI feedback from the receiver, the improvement in the rate region is still significant. 



II. System Model 

We consider a two user full-duplex cooperative OFDMA system with TV subchannels, which 
is shown in Fig. [Q and is modeled by, 

Y® = h®X®+h®X® + Z< i i >, (1) 



if) = h$}x® + ZP, (2) 
Y® = h®X? + Z?\ (3) 

where, for each subchannel i G {1, . . . , A^}, X® is the symbol transmitted by node k, is 
the zero-mean additive white Gaussian noise at node I, with variance a® ; is the fading 
coefficient between nodes k and I, and is the symbol received at node I; with k G {1,2}, 
I G {0,1,2} and k ^ I. Here, the receiver is denoted by / = 0. To simplify the notation throughout 
the paper, we define the normalized power-fading coefficients s k ( = k ( lA , and the Gaussian 
capacity function C(x) = | log(l + x). For a real number x, we define = max(i, 0). 

III. Long-term Achievable Rates for Cooperative OFDMA 
We first briefly review the channel non-adaptive IntraSCE and InterSCE strategies proposed 
in [fT3l , which shall be extended to obtain our channel adaptive model and rate regions. Both 
mutually cooperative strategies are of decode and forward type, and rely on block Markov super- 
position encoding at the transmitters, and backward decoding at the receiver. The communication 
takes place in B blocks. The message Wk[b) of each user k G {1, 2} in block b is divided into two 
submessages, Wko[b] and Wkj[b], intended to be decoded at the receiver and cooperative partner 
j G {1,2} respectively, which are further divided into A" submessages each, 

«*>[&] = {«#[&], , w k3 [b] = {«#[&], ...Af[b}} , (4) 

to be transmitted over disjoint subchannels. In both IntraSCE and InterSCE, the transmitted 
codeword by each user k over each subchannel % in block b G {1, . . . , B} is given by, 

v(») _ , /„(*) y(') I „ /_-(*) y(«) I , I Ji) ttW) 
(i) (i) (i) 

Here, the component codewords X k ^, X k - and U k are all selected from codebooks which are 
randomly generated from unit Gaussian distributions. In a given block b, the task of X k l is to 
transmit fresh information w k ^ [b] directly intended for the receiver; while the codeword Xj^j is 
used for establishing common information, w$[b], at the cooperating partner. User j decodes 
w k j[b] at the end of block b using X k j , and treating X k l as noise. The difference of IntraSCE 
and InterSCE lies in the way U k \ which is the codeword used for conveying the previously 



established common information to the receiver, is mapped to the messages. In IntraSCE, is 
used to re-transmit the cooperative submessages, w^j[b— 1] and w^[b—l] received on subchannel 
i in block 6—1, to the destination, over the same sub-channel. However in the InterSCE strategy, 
after common information is established at the cooperating partner, the cooperative messages are 
re-partitioned, and may be used to transmit sub-messages received over other subchannels. 
Note that, since both users will know Wkj[b — 1] and Wjk[b — 1] at the end of block b — 1, Ujf is 
commonly known to both users, and does not act as further interference while decoding w$[b\. 
The details of the achievability scheme for the channel non-adaptive case can be found in llT3l . 

Note that, © does not utilize instantaneous CSI to adapt the instantaneous transmission 
powers. However, if we assume that the users and the receiver have full CSI of both the 
cooperative links and the direct link, the users can further adapt their transmitted symbols 

(i) 

Xj. as a function of the joint fading state s, to maximize the long term (ergodic) achievable 
rates. In general, there are two ways to perform such channel adaptation: we can either use a 
variable power, variable rate codebook, as in |fl4|. or we can use a single codebook, whose rate 
is supported by the channel in the long term, and perform the channel adaptation by simply 
multiplying entries from this codebook by channel adaptive powers, as in [fT5l . In this paper, we 
employ the latter approach, and propose a channel adaptive version of the encoding strategies 
in [fT3l . where we scale each of the codewords in © by variable powers, 



where k, j E {1, 2}, k ^ j, i = 1, • • • ,N. The powers are subject to the average power constraints, 

N N 



< Pk- (7) 
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The achievable rate regions for power controlled IntraSCE and InterSCE are obtained by 
extending [fT3l Corollary 1] and lfT3l Corollary 2] respectively, using the new the channel adaptive 
encoding defined in ©. The resulting achievable rate region for IntraSCE with power control 
is given by the closure of the convex hull of all rate pairs (R l: R 2 ) satisfying 
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and the achievable rate region for InterSCE with power control is given by the closure of the 
convex hull of all rate pairs (i?i,i?2) satisfying 
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,=i L \«M00+V V *M(<0+1, 
where the convex hulls are taken over all valid PA policies. In the next section, we obtain the PA 
policies which achieve the rate tuples on the rate region boundary. To do this, we first derive a 
simplifying property of optimal PA for both cooperative encoding strategies, and then we focus 
on InterSCE that provides superior achievable rates. 



IV. Channel Adaptive Power Allocation 

If we set N = 1 in (l8l)-(fT0l) or (fTTI)-([T3l). the problem reduces to a scalar cooperative MAC. 
In jH, it was shown for this scalar case that, based on the instantaneous channel state, the 
optimal PA dictates that each user either sends cooperative information, or fresh information, 
but not both. Although in OFDMA, there is a sum power constraint over the subchannels, and 
one would expect the PA over each subchannel to be dependent on the powers assigned to the 



other subchannels, we show that many properties of the optimal PA for the proposed cooperative 
OFDMA system remain surprisingly parallel to those in the scalar case [8J, and the codewords 
that should be used over each subchannel are determined solely by the instantaneous fading 
coefficients over that particular subchannel, as stated in the following lemma: 

Lemma 1: The PA policy that maximizes the sum rate of a cooperative OFDMA system using 
IntraSCE and InterSCE should satisfy; 

1) pg*(s)=pg*(s) = 0, ifs eS u 

2) pg>)=#(s) = 0,ifs eS 2 , 

3) #(s) = #(s) = 0,ifs eS 3 , 

4) pg» = pg*(s) = or pg*(s) = pg*(s) = or pg*(s) = pg*(s) = 0, if s e «S 4 , 

where Si = {s : sj? > 4o> 4i > 4o}» S 2 = {s : s§ > sJo, 4i < 4o}> S 3 = {s : s$ < 
s (») s (») ^ s (»)i c _ r a . „(*) <- „(») „(*) <• S W\ 

6 10 3 6 21 6 20J' °4 — I s • »12 *10' *21 — ^20 J- 

Proof: Assume that we know the total optimal power pjj, (s), allocated to each subchannel 
i at each channel state s. Then, for IntraSCE, the sum rate (fTOl) is maximized if each term in 
the summation is maximized. Since the total power allocated to each term is fixed, we have TV 
independent optimization problems, and by HI Proposition 1] the result follows. For InterSCE, 
the sum rate (fT3l) is maximized if each argument of the minimum operation is maximized. The 
first argument of (fT3l) is insensitive to the choice of pj^*(s) or p^j*(s), as long as their sum is 
fixed; whereas the second argument is maximized if we separately maximize its summands for 
each i. The result follows by noting that this is also equivalent to N independent optimization 
problems, each yielding a scalar case, and Proposition 1] holds, giving the desired result. ■ 

An important observation is that, setting two of the powers equal to zero as suggested by 
Lemma [Q is also optimal for the entire rate region maximization, as the right hand sides of all 
three constraints, for both policies, are maximized by choosing the powers according to Lemma 
1 Q Therefore, from now on we focus only on policies that satisfy Lemma CD 

Note that, the bounds (fTTI) . (fT2l) and (TT3l) on R x , R 2 and Ri + R 2 respectively for InterSCE are 
looser than the corresponding bounds ([8]), © and (flOl) for IntraSCE, as the minimum operations 



'We choose the first option for s g £4, which may cause a slight deviation from optimality for the sum rate. However, this 
case rarely occurs in practice, and this suboptimality can be ignored, as it has been done in |8|. 



in d8]), © are removed, and the minimum in (flOl) is taken outside the summation, to obtain (flTT) . 
(fl~2)) and (fT3l) . As a result, the achievable rate region of InterSCE contains that of IntraSCE. 
Hence, it is sufficient to limit our focus on the InterSCE strategy, which results in a uniformly bet- 
ter rate region. Then, it is easy to check that the rate constraints in (fTT)) -(fT3T) now become concave 



in the power vector p(s) = |>$*(s),i?$*(s),p£f (s),^ j *(s),p^*(s),p^*(s), 
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lending themselves to well known techniques in convex optimization, which we discuss in the 
next sections. 

A. Achievable Rate Maximization Using Projected Subgradients 

Since all bounds of the achievable rate region are concave in powers, so is any weighted 
sum faRi + H2R2 a t the corners. Moreover, it is easy to show that the rate region is strictly 
convex (HI, lfl5l . Therefore, we can obtain points on the rate region boundary by maximizing 
Rfj, = faRi + faR 2 , where {R 1: R 2 } is the corner of the pentagon obtained for a given PA policy, 
defined by (fTTT) -(fT3T). Assuming fa > fa without loss of generality, and employing Lemma Q] to 
simplify (fTT]) -(fT3T). the optimization problem can be stated as: 
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s.t. 
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p2(8),pS(s),^(s)>o, k,je {1,2}, Mj, 

where £g d denotes the expectation over s e Sd, d — 1, 2, 3, 4. 

Due to the minimum operation in (fl~4l) . the gradient of the objective function does not exist 
everywhere. In particular, there are two gradient vectors, depending on which argument of the 



,0, 



,0 



minimum in (fl4l) is active. Yet, these vectors may be viewed instead as subgradients, which 
makes it possible to employ the method of projected subgradients, for power optimization. Due 
to the convex nature of our constraints, this method is guaranteed to converge to the global 
optimum lfT6ll . with a diminishing stepsize normalized by the norm of the subgradient. 

Since the calculation of the subgradients requires rather tedious formulas which give little 
insight, we will directly provide some examples of the achievable rate region, and the resulting 
PA policy, based on simulations in Section |V] instead. The major drawbacks of the subgradient 
algorithm are its slow rate of convergence, and complexity. As the number of subchannels 
increase, so does the size of the vector of power variables, making the process of computing 
the subgradients, and the projection operations formidable. Hence, we next obtain analytical 
expressions for the weighted sum-rate optimal power control, and propose an alternative iterative 
algorithm which converges much faster than the subgradient algorithm. 



B. Iterative Achievable Rate Maximization Based on KKT Conditions 

The optimization problem (fl4l) . can be stated in an equivalent differentiable form 
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(16) 
(17) 

(18) 
(19) 



Note that, due to the concavity of the logarithm, (U5l)-(fT9l) is a convex optimization problem, 
with differentiable constraints, and hence the KKT conditions are necessary and sufficient for 
optimality. Assigning the Lagrange multipliers 71,72, Ai,A 2 to the constraints (fT5l)-(fT8l). and 



ej^(s), t = 1,...,6, to the positivity constraints (fl9l) . we obtain the conditions for optimality, 
given in the following lemma. 

Lemma 2: Define the variable i = 1, • • • , N; and the indices m, n as follows: 



= 1 + 8®$(b) + + 2V s S4M!(s)pg(s), (20) 

0, if s G <S 3 U S 4 ( 0, if s G S 2 U <S 4 

m = { , n = < . (21) 

2, if s G Si U S 2 I 1, if s G Si U S 3 

A power allocation policy p^(s)> p 2 n( s )' p[/!( s )' Pc/2( s ) * s optimal for the problem (fT5T)- (fl9l) . 
if and only if it satisfies, for s G 5i U 5 2 U 5 3 = <S|, 

(0 W 
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where the Lagrange multipliers 71, 72 = 1 — 71, Ai and A 2 are selected so that the constraints 
([T5T)-(fT8T) are satisfied with equality. Each of the constraints (l22)) . (|23l) and (f24|) (correspondingly 
(1231) . (|26l) and (1271) when s G £4) are satisfied with equality if and only if the respective power 



levels, pi^s), ^2n( s ) or Pui( s ) are positive. 

Proof: See Appendix. ■ 

The optimality conditions given in Lemma [2] for each power component are heavily cou- 
pled, thereby making the computation of the optimal PA policy seemingly difficult. Yet, in the 
following theorem, we show that, after some non-trivial observations, the coupling among the 
constraints is partially removed, and as a result, we are able to provide closed form expressions 
for the optimal power levels. 

Theorem 1: For a cooperative OFDMA system employing InterSCE, the optimal PA, p^(s), 
P2n( s )> Pui( s )> Pui( s )> m at solves (fT5l) -([T9l) is given by 
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otherwise, where 71, Ai and A2 are selected to satisfy the constraints (fT"5T)-(fT8l) with equality, 
and the function /(-, •, •) is defined as f(a, b, c) = ( ~ b+v 2 ^~ 4nc ) + . 

Proof: We start by noting that, to obtain coherent combining gain, the optimal cooperative 
powers (s), k = 1,2, over a given subchannel and given channel state s, should either be 
both positive, or both zero. Let us first assume that both pj^(s) and p$(s) are positive. Then, 
the constraints (|24l) . (equivalently (|27l) ) should be satisfied with equality, for k = 1,2. Evaluating 
(|24]>. (equivalently (|27l) ) separately for k = 1,2, and dividing the resulting equalities, we get 
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Plugging (1331) into (|24l) (equivalently (1271)). we achieve the following crucial equality 
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The significance of (13~6l) is that, its left hand size, which involves all power components through 
A®, and appears in all of (I22l)-(l2~7l). can be replaced by a term which depends only on the 
fixed Lagrange multipliers, Ai and A2, and the direct link gains, 4o- Therefore, the optimality 
conditions for Pi„(s) and £>2n( s ) can be rewritten independently of pjjj(s). For example, using 
(1361) in (O, we get 
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which yields the waterfilling solution, ( |29al ). Similarly, using (1361) in (|23|) . (1251) and (|26|) . we 



obtain (|30al) . (|29bb and (I30bl) respectively. The expression, (|28T) . of optimal p^(s) follows from 
(EH), © and (EU). 

Note however that, p[/ (s) obtained by (1281) is not guaranteed to be positive. In case it is not, 
this means that (|24j) (equivalently (1271) ) is satisfied with strict inequality, the optimal solution 
for ' (s) should be set to and (1361) can no longer be used. Then, when p[/ (s) = 0, instead 
of (l22)) - (|2~3~l) and (|25T) -(l26l) we have to apply the conditions: 



s (<) S W 
(/ii - /i 2 + 7i/i 2 ) (i) lm (0 + 72^2 (i) q 10 (i) (i) < Ai, (38) 

1 + «l>lm ( S ) 1 + SloVlm ( S ) + «2oV2n ( S ) 

for s G <Si U S 2 U <S 3 , and 

M2 (0 ^ (0/ - < A 2 , (41) 

for s 6 <S 4 . 

When p[/^(s) = 0, & = 1,2; the powers £>i™(s) and P2n(s) are automatically independent 
of pjj (s). However, (1381 ) and (1391) ; <l40l ) and (14TT) are coupled, and each should be solved by 
finding the positive roots of a quadratic equation. Since all power values are non-negative, i.e., 
Pim(s) > and p^nO 3 ) — 0' we can achieve p^(s) in (|32a|) , P2n( s ) m <!33ab by solving ( 1381 ) 
and (|39l . Similarly, Pj^(s) in (|32bl) and p 2 n( s ) in <!33bl) can be obtained using (00]) and (|4TT) . 
7i, Ai and A 2 are selected in such a way that, when the power levels in (|28l)- (|33b|) are used, the 
constraints (fT5l)-(fT8l) are satisfied. ■ 

The power levels of the cooperative codewords on each subchannel, p[^( s ) an ^ P 2 n( s ) m <!29al) 
and (130al) . have an interesting single user waterfilling type interpretation, as they solely depend 
on the channel gains of only that particular subchannel, and the fixed Lagrange multipliers. The 
water level is determined by the direct link gains. However, in (I32al) and (|33al) the power Pi^s) 
depends on £>2n(s), an d vice- versa: increasing one of the powers will decrease the other, should 
the constraints (|38l) -(l4TI) be satisfied with equality, and we now have a multi-user waterfilling 



type solution. This is somewhat different than the observations in [[8]|, which conjectured that a 
single user waterfilling type solution for cooperative powers would be sufficient in all scenarios, 
for the much simpler case of the scalar MAC, and sum rate maximization only. 

At this point, it should be clear that although (|29al) - (l30bl) and (I32al) - (l33bl) do not explicitly 
depend on p[/ (s), the decision regarding which of these equations should be used while com- 
puting Pkj{s) does. Likewise, p^ k {s) are clearly functions of pjy(s), which makes equations 
(|29a| )- (l30bl) . (I32al) - (l33bl) and ([38])-([4D coupled. Note however that, the way we proved Theorem 
1 automatically suggests a natural way of solving the KKT conditions iteratively. To this end, 
we propose an algorithm which performs updates on the powers of the users, one-user-at-a-time: 
given Pu\(s) and Pil{s), it computes p[^(s) an( ^ P2i( s )> an ^ using these new values for user 
2, it re-iterates the powers of user 1. This algorithm simplifies the seemingly difficult task of 
obtaining the optimal powers from the coupled equations, and due to the convex nature of the 
problem, and the Cartesian nature of the constraints across users, it provably converges to the 
optimal solution, as at the end of the iterations, the KKT conditions will be satisfied. The outline 
of the algorithm is given below. 



Algorithm 1 Iterative Power Allocation Algorithm 
for fj, 2 — : 1 do 

while (fT5l) -([T6l) are not satisfied do 
while (fTTT ) is not satisfied do 

Calculate Pi^(s) using (I29al) - (|29bl) and PjjAs) using (1281) assuming Pu\{s) > 0, Vz 
while 3 s' s.t. p§(s') < do 

Set p^J(s') = and re-calculate p^( s ') usin g (f32ab-(l32bh and p^(s') using ([28]) 
end while 
Update Ai 
end while 

while (|T8b is not satisfied do 

Calculate j4n( s ) using (130al) - (130bl) and Pu (s) usm g <|28l) assuming p[2( s ) > ®> ^ 
while 3 s' s.t. p^(s') < do 

Set p^(s') = and re-calculate p£( s ') using (I33ah-(l33bl) and p^(s') using © 
end while 
Update A 2 
end while 
Update 71 
end while 
end for 



Perhaps the most important feature of this algorithm is that, regardless of the number of 
subchannels used, we only need to solve for three Lagrange multipliers, which relate the powers 
allocated to the subchannels, to obtain the optimum PA. This reduces the complexity of the 
algorithm dramatically, and makes it scalable, compared to the subgradient algorithm. As a 
result, the convergence is much faster. 

V. Simulation Results 

In order to obtain the optimal PA policy, and the resulting achievable rate region, we implement 
the projected subgradient algorithm, and the iterative waterfilling-like algorithm based on Karush- 
Kuhn-Tucker (KKT) conditions on optimality, for a simple case with only three subchannels. 
The achievable rate region for the InterSCE strategy is obtained by running this algorithm for 
varying priorities fj, k , and then by taking a convex hull over the resulting power optimized 
regions. In Figure [2l we compare the achievable rate regions for power controlled cooperative 
OFDM A utilizing the projected subgradient algorithm and the iterative algorithm, with those for 
several encoding strategies without power control, from lfT3l . We assume that, for the channel 
non-adaptive protocols, the users are still able to allocate their total power across subchannels 
and codewords. The total power of each user and the noise variances are set to unity. The fading 
coefficients are chosen from independent Rayleigh distributions, the means of which are shown 
in Figure [2l We observe that, when the powers are chosen jointly optimally with InterSCE, there 
is a major improvement in achievable rates. This unusually high gain from power control can 
be attributed to our ability to take advantage of the additional diversity created by OFDMA: 
PA not only allows us to use the subchannels at time varying instantaneous rates based on the 
channel qualities, but also to use them adaptively for varying purposes, i.e., cooperation, common 
message generation or direct transmission. 

In practice, the feedback channel from the receiver to the transmitters can send only a few 
bits of feedback, as otherwise a significant portion of channel resources have to be allocated to 
the reverse link which does not contribute to the channel rate. Hence, in Figure [2l we also show 
the rate region achievable with limited feedback. We assume that, since the feedback is very 
low rate, it is error free. When the receiver has access to channel gains of the users, there are 



two approaches one can take to feed back information to the users: a straightforward method is 
to quantize the channel states, and feed back the quantized fading values on each subchannel. 
Assuming a Q bit quantizer is used for each fading state, the receiver should feed back a total 
of ANQ bits of CSI to each user, and then the users will have to look up the power levels 
optimized for the quantized channel states, and use them in their transmission. An alternative 
method is to compute the optimal power levels first at the receiver, and then quantize them to 
obtain a quantized power codebook. Whenever a channel state is observed, the receiver can then 
directly feed back the quantized powers to be used to the users. Due to the structure of the 
optimal PA policy observed in Lemma 1, only two powers out of three are active for each user 
k at any given channel state, and which one will be active only depends on a single comparison, 

(i) (i) 

s kj ^ s fcO' wn i cn requires only one bit feedback. Hence, the total feedback required per user 
is (2Q + 1)N bits per user, assuming Q bit feedback is used for each power value. We use 
Lloyd-Max algorithm to quantize the powers, taking into account their probability distribution 
induced by the underlying channel state distribution. The case with Q = 1 is plotted on Figure 
[2l A quite interesting observation is that even with one bit feedback per power component, 
which is equivalent to selecting one of two possible values for each codeword's power, a large 
improvement in rates can be achieved compared to the non power-controlled scenario. With two 
bits of feedback per component, the achievable rate region is nearly the same as that for perfect 
CSI, and is omitted to avoid confusion with the subgradient rate region. 

In Figure [2l it is also observed that the gain achieved by power control through the iterative 
algorithm always exceeds the projected subgradient algorithm, especially in the sum rate region. 
The main reason is that, the subgradient algorithm had still not fully converged, when it was 
stopped at 10000 iterations, while the iterative algorithm did fully converge to the optimal PA. 
The relative convergence times of the two algorithms are shown in Figure [3l which clearly 
depicts the advantage of using the iterative algorithm over the subgradient algorithm. 

In Figure |4l we compare the rate regions in a uniform fading environment with means 
expressed on the figure. Here we ensure s G Si, with the motivation of obtaining a strictly 
optimal PA, and a simpler description of the power distributions. In this setting, since some 
of the power values are always zero, the number of power variables is less, and hence the 



subgradient algorithm nearly converges to the optimum within 10000 iterations, and the rate 
regions of subgradient and iterative algorithms nearly coincide. For this setting, we further 
analyze the optimal power distributions over the channel states, in Figures |5(a)p(c)[ |6(a)[|6(c) 



and 7(a) 7(b) 



Figures |5(a)H5(c)| and |6(a)H6(c)| demonstrate the optimal powers allocated to subchannel 1 , as 
functions of the inter-user link gains, when the direct link gains are fixed to two different sets 
specified on the figures. Powers p^p are not shown, to save space, as they are identical to due 
to the symmetry in fading. In Figures |5(a)l|5(c)[ the direct link gains are at their maximum, hence 
the cooperative powers, p$ , are always positive. In this case, we observe the expected single user 
waterfiHing type behavior for the distributions of Pi^(s) and P2i(s). In Figures [6(a)||6(c)| however, 
when the direct links are moderate on the average, we have a more interesting scenario: when s$ 
is significantly stronger instantaneously, only user 2 uses the subchannel. When both inter-user 
links are instantaneously strong, the users exchange information using simultaneous waterfilling, 
and set p^ to zero. When both inter-user links are weak, the users use the subchannel solely to 
convey common information to the RX, by using only pff and An important observation 
is that, although we make no prior assumptions on subchannel allocation to users/codewords, 
the optimal powers sometimes dictate exclusive use of some subchannels for dedicated tasks. 
The resulting power distributions show that the KKT conditions are indeed satisfied at the fixed 
point of our iterative algorithm, verifying convergence. 



In Figures |7(a)||7(b)j we plot the power distributions obtained using the subgradient algorithm 



instead, for the same setting as in Figures |6(a)[|6(c)| The subgradient algorithm is terminated 
after 10000 iterations. It is observed that while the powers p^ (s) and P21 ( s ) seem to have nearly 
converged to the optimal values shown in Figures |6(a)}|6(c)| (only p^ (s) is shown, as p^ (s) 
is simply symmetrical), the cooperative power p^ (s) has still not fully converged, though it is 
close to its optimal distribution. Note that, the effect of this is negligible on the rate regions, as 
was shown in Figure HI 



VI. Conclusion 

We obtained the optimum PA policies for a mutually cooperative OFDMA channel employing 
IntraSCE and InterSCE strategies. We developed a subgradient algorithm and a more efficient 
iterative algorithm which maximize the achievable rate region. The number of iterations of the 
iterative algorithm does not depend on the number of subchannels, which makes the algorithm 
scalable. We demonstrated that the optimal PA may also serve as a guideline for subchannel 
assignment to the users' cooperative codewords, and that PA for cooperative OFDMA provides 
significant rate improvements, even in limited feedback scenarios, due to its ability to exploit 
the diversity provided by OFDMA. 

VII. Appendix 



Note that KKT conditions are necessary and sufficient for optimality. To obtain the KKT 
conditions we first assign the Lagrange multipliers 71, 72, Ai and A2 to the inequality constraints 
(fT3T) . (fT6l) . (fTTT) . (fbSl) respectively, and we further assign ej(s), t = 1, . . . , 6, Vs to the positivity 
constraints (fl9l) . to obtain the Lagrangian 
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+ eg (s)pg (s) + (s)pS (s) + ^ (s)p^ (s) + ^ (s)pgj (s) + # (s)pg (s) + $> (s)p^ (s) . (42) 



For s E 1S1 U 1S2 U 1S3, we take partial derivatives of the Lagrangian function, C with respect to 
Pi2i( s )> P2n( s )' an d Pu k ( s )> and Vs, to obtain the respective conditions 
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where e x G {1, 2}, e 2 G {4, 5} and e 3 G {3, 6} take their values based on with respect to which 
power the derivative is taken. Likewise, for s G S4, and the respective partial derivatives yield 
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Since the optimal PA policy should satisfy the complementary slackness constraints, 
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(49) 



we can either drop (s) in each of (|43l)-(|48l, if the corresponding power is positive; or we can 
replace the equality by a strict inequality, meaning that (s) is non-zero but its corresponding 
power is zero. Hence, using the relevant conditions from (|49|) in (I43l-(l48l). and dropping the 
dependencies on e\ (s), we write the conditions for optimality in terms of inequalities instead, 
which yield (|22|) -<f27b. The inequalities hold with equality if and only if the corresponding power 
level is positive, and with strict inequality of that power level is zero. 



Partial derivatives with respect to the dual variables dictate that the conditions (fT5l)-(fT8l) are 
satisfied. Finally, partial derivatives with respect to yields 71 + 72 = 1, hence the condition 

71 = 1 - 72. 
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Fig. 1. Gaussian cooperative OFDMA channel. 
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Fig. 2. Achievable rate regions in Rayleigh fading. 
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Fig. 3. Comparison of the convergence times of the proposed algorithms in Rayleigh fading. 
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Fig. 4. Achievable rate regions in uniform fading. 




(a) Power level, p^' 




f 1' 

(b) Power level, P21 



0.02. 

0.015, 



0.01 
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6. Optimal power allocation when s$ — — 0.15, fixed and always less than s^' and SjV- When p^ is positive, 
obey single user waterfilling. As the inter-user links get stronger, it becomes more profitable to create common information, 
become 0, and the users perform simultaneous waterfilling. 
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Fig. 7. Power allocation obtained after 10000 iterations of the subgradient algorithm, when s 10 = s^q = 0.15, fixed and 
always less than s'2 ar, d s^x ■ The algorithm has not yet converged to the optimum value, despite a much longer running time 
compared to the iterative algorithm. Achievable rates are nearly within 0.1% of the optimum value. 



